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In this work we examine the properties of a recently described ordinary 
. differential equation that relates the age-specific prevalence of a chronic dis- 

ease with the incidence and mortalities of the diseased and healthy persons. 
The equation has been used to estimate the incidence from prevalence data, 
which is an inverse problem. The ill-posedness of this problem is proven, too. 
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1 Introduction 



Recently a novel ordinary differential equation (ODE) has been described that relates the 
age-specific prevalence of an irreversible disease wit h its incidence rate and the mortality 



rates of the diseased and the non-diseased persons ( jBrinksl . 120111 ). This article is about 
properties of the ODE and its solutions. 

Given the mortality rates and the age-specific prevalence, the ODE may be used to 
derive the incidence rate§, which can be interpreted as an inverse problem. Inverse 



^n this article the expressions rate and density are synonymously used. 
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problems are often examined with respect to ill- or well-posedness. A well-posed of an 
inverse problem in the sense of J acques Hadamard means that a solution exists, that 
the solution is unique and stable ( IHadamardl . Il923l ). In this article the ill-posedness of 
the inverse problem is proven. 

The article is organized as follows. Section |2] briefly reviews the derivation of the 
ODE. Then, some properties of the ODE and its solution are examined. In Section |3] 
the inverse problem is introduced and the ill-posedness is proven. Finally, in Section H] 
the results and its consequences are discussed. 



2 The ODE: Derivation and Properties 



A popular framework for studying relations between prevalence and incidence of a disease 
is the simple model consist i ng of three states as depic ted in Figure [TJ Normal, Disease 



and Death, (IKeidina. 



1991 



Murray and Lopezl . Il994l ). This model goes back at least 



until the 1950s (IFix and Neymanl . Il95ll ). In general, the transition densities from one 
state to another depend on calendar time t and age a. The transition density from 
Disease to Death may also depend on the duration d of the disease. 

People in the population under consideration can get a disease at incidence density 
i, they can die either after having got the disease at age-specific mortality rate ui\ or 
without having the disease at mortality rate tjiq. The numbers of the individuals in 
the Normal and in the Disease state are denoted by S (susceptibles) and C (cases). 
Generally, these numbers as well as the mortality and incidence densities depend on the 
calendar time t. Sometimes, these quantities are a ssumed to be in dependent of time t, 
which is referred to as the time-homogeneous case (IKeidingi . 119911 ) . 

Assuming time-homogeneity and di sease related mortality mi to be independent of 
duration d, Murray and Lopez (119941 ) described transitions along the paths in Figure 
Q]as a set of ODEs. Henceforth, beside time-homogeneity we additionally assume that 
the population is closed (no migration) and has a constant birth rate. Furthermore, 
the age-specific functions i, Too, TOi are non-negative and continuous in [0,u] for some 
u) > 0. Henceforth, u is considered as the minimal age when all members (diseased and 
non-diseased) of the population are deceased^]. Then, Equation (TjQ) describes the change 
rates of the numbers S and C of normal and diseased individuals, respectively. 



— = - (i(a) + mo (a)) ■ S 

— — = i(a) ■ S — TOi(a) • C 
da 



(1) 



2 For example by choosing u — inf{a > 0| C(a) + S(a) < 0.5}. 
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Figure 1: Three states model of normal, diseased and dead subjects. Transition densities 
may depend on calender time t, age a, and duration d of the disease. 
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The resulting set of ODEs is linear and of first order. Due to the simple structure 
of the ODEs for given age-specific incidence and mortality rates i, m and mi, the 
analytical solution of the corresponding initial value problem with initial conditions 
S(0) = S > 0, C(0) = C > 0, S Q + C > can be calculated easily: 



S(a) = So ■ exp I — / i(r) + m (r)d7 
\ o 

C{a) = exp I - J mi(r)dT J • < C + / i{r)S{r) exp I J mi (t)dt J dr I . 



(2) 



Obviously, from S + C > it follows that S(a) > 0, C(a) > and S(a) + C(a) > 
for all a G [0, u\. 

Usually, for a population under consideration the overall (general) mortality density m 
is observed or reported in life-tables. The mortality density m is a convex combination 
of the mortality density mo of the normals and the mortality density mi of the diseased: 



m(a) = p(a) ■ mi (a) + {1 — p (a)} • m (a) 

= m Q (a) • {p (a) ■ (R (a) - 1) + 1} , (3) 

where i?(a) is the relative risk, i? = In this expression p is the prevalence of the 
disease, which for a specific age a and S(a) + C(a) > can be written as 

P(a) = C ^ ] r( ■ (4) 
S(a) + G (a) 

Equation ([2]) allows the application of the ODE-system ([T]) in the case when m and 
mi are unknown. In epidemiology this typically is the case. Then the ODE-system ([T]) 
becomes non-linear and does not have an analytical solution anymore. 

Interestingly, the two-dimensional system can be reduced to a one-dimensional ODE, 
which is stated in the following Theorem. This can easily by derived from the quotient 
rule and Equation ([T]). 

Theorem 2.1. Let mortality densities m,mo € C°([0,o;]) and S,C G C 1 ([0,o;]) with 
S(a) + C(a) > for all a G [0, to] , then p = is differentiable in [0, to] and it holds 

dp 

— = (1 -p) ■ [i - (m-mo)) . (5) 
da 
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Table 1: Types of the ODE (jSJ) depending on the given mortality. 



Given mortality 



Right hand side of Eq. 



Type of the ODE 



m, mo 
m, mi 
m ,mi 
m , R 
nil, R 
m, R 



(1-p) 
(1-p) 

(1-p) 
(1-p) 
(1-p) 
(1-p) 



i — (m — mo)] 



mi—m 
1-p 



i — p ■ [nil — mo)] 
i — p ■ niQ ■ (R — 1)} 
i — p ■ mi ■ 

gCg-jJ 



m 



p(R-l)+l 



Linear 
Linear 

Riccati 
Riccati 
Riccati 
Abelian 



Depending on the type of information about the mortality densities, the ODE §5§ 
changes its type (Tabled]), which is important when solving the ODE. In case the ODE 
i s linear , an ea sy analytical solution exists. If the ODE is of Riccati or Abelian type 
(jKamkd . 119831 ). a general analyt i cal so lution does not exist. An extensive monograph 
about Riccati equations is (jReidJ, Il972l ). 



The fractions -^M- and 



y(fl-i) 



/; '""^ p .fjt-i)+ i * n ^ e ^ as ^ ^ W0 rows > are ver y wen known in epidemi- 
ology. These are the expo sition attributable risk (EAR) and the population attributable 



risk (PAR), respectively, ( jWoodwardl . 120051 ). 



Next it is examined, if the solutions of the one- dimensional ODE ([5]) are epidemiolog- 
ically meaningful, i.e. p(a) £ [0, 1] for all a £ [0, a;]. For the system ([1]) this is clear: the 
age-specific prevalence p(a) = s ^^c(a) & ven by ^ ne solutions 02]) are meaningful for all 
a £ [0,o;]. However, it is not obvious that solutions p of (EJ) are between and 1. For 
the special case that m = nil - this case is called non- differential mortality - it can be 
proven directly. Then the solution of © with initial condition p(0) = p £ [0, 1] is 



p(a) 



1-(1 



Po) ■ exp 



i(r)dr 



and the epidemiological meaningfulness follows immediately. In case of differential mor- 
tality (mo 7^ nil) epidemiological meaningfulness cannot not be proven directly, because 
it has to include all the different cases of the right hand side of (jSJ) in Table [U In- 
stead of a direct proof we use the correspondence between Equations ([T]) and Let 
N(a) := C(a) + S(a) denote the number of persons alive at age a, then it holds N(a) > 
for all a £ [0,a>]. We augment Equation (]5]) by another ODE in with m defined in 
Equation 
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— = (1 - p) • {% - (to - mo)) 

da / 6 n 

dN Ar W 

— ■ — = — m • iv. 
da 



Then we have the following correspondence between the ODE-systems ([T]) and 

C(a) 
S(a)+C(a) 



Theorem 2.2. (A) If S(a),C(a) are solutions of (pQ) ; £nen p(a) := 9 , tt% ] and 



N(a) := C(a) + 5(a) are solutions of 

(B) Ifp(a),N(a) are solutions of §6§, then S(a) := {1 — p(a)} • iV(a) and C(a) := 
p(a) • N(a) are solutions of ([1]). 

Proof. This is an easy exercise in calculus. □ 

From Theorem 12. 21 (B) it follows that p(a) G [0, 1] for all a G [0, u] : If p(a) is a solution 

CM 

N(a)- 



of ([HD, then it has a representation p(a) = jttt- Since < C(a) < N(a), the solution p 



is epidemiologically meaningful. 

Remark 2.1. From Theorem \2.B (A) it is obvious that ODE-system ([T]) implies ^ = 
— m • N . This is the defining equation o f a stationary popu l ation which is a population 



with a special type of age distribution ({Preston and CoalA . \1982l) . Hence, ([!]) is valid 



only for stationary populations. Since most populations are non- stationary, this is a 
heavy limitation. However, it can be shown that §5§ holds true in general populations as 
long as certain restrictions on the migration rate are fulfilled. Details are not subject of 
this work and will be elaborated in a subsequent paper. 



3 The Inverse Problem 



A key application for the ODE ([5]) is the derivation of the age-specific incidence rate i(a) 
from p(a) if the mortalities (or any equivalent information in the first column of Tabled]) 
are known. In epidemiology, typically incidences rates are surveyed in follow-up studies, 
which may be very lengthy and costly. If the model assumptions for ODE ()5]) hold true, 
the equation can be solved for i(a). Beside mortality information, the age course of the 
prevalence has to be known, whic h can be obtai ned from relatively cheap cross- sectional 
studies. An example is shown in ( iBrinksl . 1201 it ) . 

In such an application with given mortalities, we conclude from an effect (the preva- 
lence) to the underlying c ause (the incidence), which can be interpreted as an inverse 
problem ( Tarantolal . 2005 ). The inverse problem is opposed to the direct problem of 
inferring from the incidence (i.e. the cause) to the prevalence (the effect) by ODE 
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5]) . Now we show that the inverse problem is ill-posed in the sense of IHadamard 
19231 ). For given (sufficiently smooth) mortalities and po G [0, 1] define the operator 
p : C°([0,co]) —> C 1 ([0,o;]),2 H- such that p(0) = p and p is the solution of To 

+ m — itlq is dis- 



dp/da 
1-p 



show that the inverse problem is ill-posed we prove that p~ : p i— > 
continuous. It is sufficient to show this for the special case of non-differential mortality 
(m = mo). Let C k ([0, u]), k — 0, 1, be equipped with the maximum norm ||-||. Choose 

< e 



p G C 1 ([0,o;]), e > and define p e ,„(a) 
and 



p(a) + e • sin(n • a). Then, it is \\p — p e 



dp 1 



dp e 



da 1 — p da 1 — p e 



esin(n-) + encos(n-)(l — p) 



(1 — p)(l — p — esin(n-)) 

For e n := n _1//2 and p(a) ^ 1 the term e n ncos(na)(l — p(a)) is unbounded as n — > oo, 
which implies that p~ l is discontinuous and the inverse problem is ill-posed. 



4 Discussion 



By extending the framework of (IMurray and Lopea . Il994l ) for studying the relation be- 



tween prevalence and incidence, it had been found that prevalence, incidence and mor- 
tality are linked by a simple one-dimensional ODE. In this article it has been shown that 
the solutions of this ODE are epidemiologically meaningful. Depending on the type of 
mortality information available, the ODE changes its type, which has implications about 
existence of general analytical solutions. In many epidemiologically relevant cases, an 
analytical solution does not exist, and numerical treatment has to used instead. 

An important application of the ODE is the derivation of age-specific incidences from 
the age distribution of the prevalence. This article shows that this question can be 
interpreted as an inverse problem. Furthermore, the inverse problem is ill-posed. The 
proof of the ill-posedness shows that an additive high frequency distortion (esin(n-)) of 
the prevalence may lead to an unbounded inaccuracy in the derived incidence. How- 
ever, high frequency distortions might be unlikely in real chronic diseases. Hence, the 
consequences in practical epidemiology are unclear so far. 

In the discussed ODE model, several assumptions have been made. The ODE is 
valid only if incidence and mortality rates are independent from calendar time. Due 
to changes in medical progress, hygiene, nutrition and lifestyle, mortality does undergo 
secular trends. Thus, it is appropriate to formulate Equation ([5]) as a partial differential 
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equation, which is subject of a subsequent paper. Moreover, in real diseases mortality 
of the diseased persons depend on the duration of the disease. An ex ample is diabetes 



wher e the relative mortality over the diabetes duration is U-shaped (jCarstensen et al 



20081 ). Du ration depende ncy obfuscates the relation between prevalence, incidence and 



mortality f lKeidingl . Il99lh . Results as easy as presented here are unlikely not be expected. 

A last note refers to the term chronic disease: In this article, chronic means irre- 
versible, i.e. there is no way back from the Disease to the Normal state. However, most 
of the results presented here remain true, if there is remission back to state Normal. 
Then , the fundame ntal ODE (J3]) has an additional term that depends on the remission 
rate jBrinkj , MLm) ■ 
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